Innovation flow through social networks: Productivity 

distribution 

T. Di MattecO and T. Aste 
O ■ Department of Applied Mathematics, 

CN \ Research School of Physical Sciences and Engineering, 

Ch ■ 

^ ' The Australian National University, Canberra ACT 0200, Australia. 



a^ 



Oh' 



M. Gallegati 

Department of Economics, Universita Politecnica delle Marche, 

Piaz.le Martelli 8, 1-60121 Ancona, Italy. 



O ; (Dated: February 2, 2008) 



Abstract 
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^^1 A detailed empirical analysis of the productivity of non financial firms across several countries 

O i' and years shows that productivity follows a non-Gaussian distribution with power law tails. We 

demonstrate that these empirical findings can be interpreted as consequence of a mechanism of 

exchanges in a social network where firms improve their productivity by direct innovation or/and 

f~^ ' by imitation of other firm's technological and organizational solutions. The type of network- 

f^ ■ connectivity determines how fast and how efficiently information can diffuse and how quickly 

^. 

^D . innovation will permeate or behaviors will be imitated. From a model for innovation fiow through 

O ' a complex network we obtain that the expectation values of the productivity level are proportional 

• 1— I ' 

C/3 ' 

J>^' to the connectivity of the network of links between firms. The comparison with the empirical 

^' 
CLc distributions reveals that such a network must be of a scale-free type with a power-law degree 

, , , distribution in the large connectivity range. 

>< 
. 5^ ; PACS numbers: 89.65.Gh, 89.75.Hc, 89.75.-k, 89.75.Da. 
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I. INTRODUCTION 

Recently, the availability of huge sets of longitudinal firm-level data has generated a 
soars of productivity studies i„ the ecouom.c literature llfl&flflBa. There are several 

measures of productivity |8|], in this work we consider two basic measures: labour and 
capital productivity. The Labour productivity is defined as value added over the amount 
of employees (where value added, defined according to standard balance sheet reporting, 
is the difference between total revenue and cost of input excluding the cost of labour). 
Although elementary, this measure has the advantage of being accurately approximated 
given the available data. The other alternative measure is the capital productivity which 
is defined as the ratio between value added and fixed assets (i.e. capital). This second 
measure has some weakness since the firms' assets change continuously in time (consider 
for instance the value associated with the stock price). Usually the literature recognizes 

n 

that the productivity distribution is not normally distributed frl, and empirically 'fat tails' 
with power law behaviors are observed. But the mainstream proposed explanations cannot 
retrieve this power law tails yieldin g -a t best- to log-normal distributions J9|,|l0|. According 
to the evolutionary perspective [UlllSl) firms improve their productivity implementing new 
technological and organizational solutions and, by this way, upgrading their routines. The 
search for more efficient technologies is carried out in two ways: (1) by innovation (direct 
search of more efficient routines); (2) by imitation of the most innovative firms |l3l.ll4|. In 
practice, one can figure out that once new ideas or innovative solutions are conceived by 
a given firm then they will percolate outside the firm that originally generated them by 
imitation from other firms. In this way the innovation flows through the firms. Therefore, 
the network of contacts between firms which allows such a propagation must play a decisive 
role in the process. 

In this paper we introduce a model for the production and flow of innovation in a complex 
network linking the flrnis. We show that the resulting productivity distribution is shaped by 
the connectivity distribution of this network and in particular we demonstrate that power 
law tails emerge when the contact-network is of a scale-free type. These theoretical flnding 
are corroborated by a large empirical investigation based on the data set Amadeus, which 
records data of over 6 million European flrms from 1990 to 2002 jl5| . A statistical analysis of 
such a data reveals that: (i) the productivity is power law distributed in the tail region; (ii) 



this result is robust to different measures of productivity (added value-capital and capital- 
labor ratios); and (iii) it is persistent over time and countries [l5|. A comparison with the 
theoretical prediction reveals that the empirical data are well interpreted by assuming that 
the contact network is of scale-free type with power law tailed degree distributions. 

The paper is organized as follows: Section IH] recalls the concept of social network; Section 
Uni introduces the model supporting the technological distribution while Section HVl describes 
the empirical findings. A conclusive section summarizes the main results. 

II. CONTACT NETWORKS IN SOCIAL SYSTEMS 

Systems constituted of many elements can be naturally associated with networks link- 
ing interacting constituents. Examples in natural and artificial systems are: food webs, 
ecosystems, protein domains, Internet, power grids. In social systems, networks also emerge 
from the linkage of people or group of people with some pattern of contacts or interactions. 
Examples are: friendships between individuals, business relationships between companies, 
citations of scientific papers, intermarriages between families, sexual contacts. The relevance 
of the underlying connection-network arises when the collective dynamics of these systems 
is considered. Recently, the discovery that, above a certain degree of complexity, natural, 
artificial and social systems are typically characterized by networks with power-law distri- 



butions in the number o 
scientific interest lla, 17 



links per node (degree distribution), has attracted a great deal of 



18|. Such networks are commonly referred as scale-free networks 
and have degree distribution: p^ ~ k~°' (with p^ the probability that a vertex in the network 
chosen uniformly at random has degree k). In scale- free networks most nodes have only a 
small number of links, but a significant number of nodes have a large number of links, and 
all frequencies of links in between these extremes are represented. The earliest published 
example of a scale-free network is probably the study of Price jl^ for the network of ci- 
tations between scientific papers. Price found that the exponent a has value 2.5 (later he 
reported a more accurate figure of a = 3.04). More recently, power law degree distributions 
have been observed in several networks, including other citation networks, the World Wide 
Web, the Internet, metabolic networks, telephone calls and the networks of human sexual 
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All theses systems have values of t 



between 0.66 and 4, with most occurrences between 2 and 3 Ul 



le exponents a in a range 



m 



When analyzing the industrial dynamics, it is quite natural to consider the firms as 
interacting within a network of contacts and communications. In particular, when the 
productivity is concerned, such a network is the structure through which firms can imitate 
each-other. Our approach mimics such a dynamics by considering simple type of interactions 
but assuming that they take place through a complex network of contacts. 

III. INNOVATION FLOW 

The innovation originally introduced in a given firm 'i' at a certain time t can spread 
by imitation across the network of contacts between firms. In this way, interactions force 
agents to progressively adapt to an ever changing environment. 

In this section we introduce a model for the fiow of innovation through the system of firms. 
We start from the following equation describing the evolution in time of the productivity xi 
of a given firm '/': 

xiit + 1) = xi{t) + Ai{t) + J2 Qj^iit)[xjit) - x,{t - 1)] (1) 

t-i 
-Y,<ll^\t)[xi{t-r)-x,{t-r-l)]. 

T=l 

The term Ai{t) is a stochastic additive quantity which accounts the progresses in productivity 
due to innovation. The terms Qj^i are instead exchange factors which model the imitation 
between firms. These terms take into account the improvement of the productivity of the firm 
'/' in consequence of the imitation of the processes and innovations that had improved the 
productivity of the firm 'j' at a previous time. Such coefficients are in general smaller than 
one because the firms tend to protect their innovation content and therefore the imitation is 
-in general- incomplete. In the following we will consider only the static cases where these 
quantity are independent on t. The term q^ is: 

qi'^ = Y^Q.^iQi^, forr = l (2) 

it^ = E ^^--^ E Qi^h,Qh,^h, ■ ■ ■ Qk^,^, for r > 2. (3) 

ieX; hi...hr-i 

This term excludes back-propagation: firm 7' imitates only improvements of the productivity 
of firm 'j' which have not been originated by imitation of improvements occurred at the firm 



'/' itself at some previous time. The system described by Equation Q] can be viewed as a 
system of self-avoiding random walkers with sources and traps. 

The probability Pt+i{y, l)dy that the firm / at the time t + 1 has a productivity between y 
and y + dy is related to the probabilities to have a set {Qj^i} of interaction coefficients and 
a set of additive coefficients {Ai(t)} such that a given distribution of productivity {xj{t)} at 
the time t yields, through Equation [TJ to the quantity y for the agent / at time t + 1. This 
is: 

daAtia,l)l[ / dxPPt^^ix[^\l)--- (4) 

-OO /: r\ J —CO 



OO 
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dx'^>P,_^ix'^\N) 



t-i 



j<^Il r=l 

where S{y) is the Dirac delta function and At{a, I) is the probability density to have at time 
t on site I an additive coefficient Ai{t) = a. Let us introduce the Fourier transformation of 
Pt{y, I) and its inverse 



/OO 
dye^^y^^PtiyJ) 
-OO 



-OO 

r>00 



PtiyJ) = ^ I d^e~^y^p,{v,i) . (5) 

In appendix we show that Equation 0] can be re- written in term of these transformations, 
resulting in: 

t-i 

K,{V,l) = M^J)Pt{^,l)l[Pt-^{{-qP+qt'^)v,l) 

Po(gf-'V,/)A-i(-grV,0 (6) 

Y[Pt{Qj^i(pJ)Pt-i{-Qj^iip,j) , 
jeTi 

with At{f,l) being the Fourier transform of At{a,l). From this equation we can construct 
a relation for the propagation of the cumulants of the productivity distribution. Indeed, by 
definition the cumulants of a probability distribution are given by the expression: 

kl^\t) = {-zr^\nP,{^J) ^, (7) 
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where the first cumulant k\ (t) is the expectation value of the stochastic variable xi at 

the time t ({xi{t))) and the second cumulant A;} (t) is its variance {af{t)). By taking the 

logarithm of Equation IHl and applying Equation [7| we get: 

t-i 



kt\t+i) = c^'^\t) + k[''\t) + j2i<it'^-^Pn['\t-o 



(8) 
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J2i(Q^~*^y^f^(t) + {-Q,^irkf\t - 1)] . 

It has been established by Maddison that the average innovation rate of change in the 



OECD countries since 1870 has been roughly constant 

{Mt + I)) - {A,{t)) 



27j . In our formalism this implies 
const. (9) 



Therefore, the mean of the additive term in Equation^ ((^^(t))) must grow exponentially 
with time and consequently the first cumulant (the average indeed) reads: c'-^-* = Cq {c\ )*. 
Equivalently we assume an exponential growth also for the other moments {c-'^' = Cq {cf Y). 
Equation |H1 can now be solved by using a mean-field, self-consistent solution (neglecting 
correlations and fluctuations in the interacting firms) obtaining: 
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kr\t) = ^f- 
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for V = 1 
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for V > 1 



(10) 



where 
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(11) 

(12) 



(13) 
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with Q being the average exchange factor. When this exchange term is small, Equation ITUl 
can be highly simplified by taking the first order in Q only, leading to: 
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c\ — 1 

Equation^] (and its simplified form f Equation I15|)) describes a mean productivity which 
grows at the same rate of the mean innovation growth (as a power of c]^ ) and is directly 
proportional to the number of connections that the firm has in the exchange network. From 
Equation El we also have that all the cumulants increase with a corresponding power rate 
((^1 )*)• But, if we analyze the normalized cumulants: \^'^''{t) = k^ (t)/[ki (t)Y''^ we 
immediately see that at large t they all tend to zero excepted for the mean and the variance. 
Therefore the probability distributions tend to Gaussians at large times. 

Summarizing, in this section we have shown that, at large t, the expectation value of 
the productivity level of a given firm is proportional to its connectivity in the network 
of interaction and the fiuctuations around this expect at ion- value are normally distributed. 
Each firm has a different connectivity and therefore the probability distribution for the 
productivity of the ensemble of firms is given by a normalized sum of Gaussians with averages 
distributed according with the network connectivity. As discussed in the previous section, 
power-law-tailed degree distributions are very common in many social and artificial networks. 
It is therefore natural to hypotheses that also the social/information network through which 
firms can exchange and imitate productivity has a degree distribution characterized by 
a power law in the large connection-numbers region. If this is the case, then the whole 
productivity distribution will show a power-law tail characterized by the same exponent of 
the degree distribution J28|. 

IV. EMPIRICAL ANALYSIS AND COMPARISON WITH THEORY 

Figures HJ 121 El and m show the log-log plot of the frequency distributions (Left) and the 
complementary cumulative distributions (Right) of labour productivity and for capital pro- 
ductivity measured as quotas of total added value of the firms. In these figures the different 
data sets correspond to different years: 1996 — 2001. For the sake of exposition, we illustrate 
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FIG. 1: Frequency distributions (Left) and complementary cumulative distributions (Right) for 
the labour productivity in Italy in the years 1996-2001. The theoretical behavior is for a = 2.7, 
m = 22, n = 11, a = 10 and /? = 3. 
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FIG. 2: Frequency distributions (Left) and complementary cumulative distributions (Right) for 
the labour productivity in France in the years 1996-2001. The theoretical behavior is for a = 2.1, 
m = 30, n = 4, a = 20 and /3 = 1. 



the productivity distribution for France and Italy only, but similar results have been obtained 
for other Euroland countries of the AMADEUS dataset. The frequency distributions show 
a very clear non-Gaussian character: they are skewed with asymmetric tails and the labour 
productivity (Figures^and |21(Left)) present a clear leptokurtic pick around the mode. The 
complementary cumulative distributions (P>(x), being the probability to find a firm with 
productivity larger than x) show a linear trend at large x implying a non-Gaussian character 
with the probability for large productivities well mimicked by a power-law behavior. 

The model presented in this paper gives a simple explanation for the occurrence of 
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FIG. 3: Frequency distributions (Left) and complementary cumulative distributions (Right) for 
the capital productivity in Italy in the years 1996-2001. The theoretical behavior is for a = 3.8, 
m = 0.04, n = 0.02, a = 0.01 and (i = 25. 

such power law tails in the productivity distribution: they are a consequence of the so- 
cial/information network which is of "scale-free" type (analogously with several other com- 
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plex systems where such a connectivity-distribution can be measured 
Indeed, we have shown that distribution for the productivity of the ensemble of firms is given 
by a normalized sum of Gaussians with averages distributed according with the network con- 
nectivity. As consequence, when the connection network is of scale-free type the productivity 
distribution must share with it the same exponent in the power-law-tail. 

Comparisons between the theoretical predictions from Equation ^1 associated with a 
scale- free network and the empirical findings are shown in the Figures^ 121 El and 0] (Right). 
In particular, accordingly with Equation ^l we assume an average productivity given by 
k\ = m + zin, a variance equal to a and the degree distribution of the network given by pk oc 
fc~"exp(— /3/A;). The agreement with the empirical findings is quantitatively rather good. 
We note that, although there are several parameters, the behavior for large productivity is 
controlled only by the power-law exponent —a. On the other hand, in the small and the 
middle range of the distribution the other parameters have a larger influence. 

From our analysis we observe that the theoretical curves fit well the empirical findings 
by assuming the power law exponent equal to a = 2.7 and 2.1 for the labour productivity in 
Italy and France respectively. These exponents are in good agreement with the ones typical 
of the degree distribution in social networks. On the other hand the capital productivity 
presents much steeper decays which can be fitted with exponents 3.8 and 4.6 respectively. 



10 

N(Ax) 




Q 2001 
2000 



1997 
1996 



10 10 V 







AjTlAVilri.. 


' 




10°< 

PJx) 




O 2001 

2000 

1999 

1998 




^ 


10"' 




1 




1997 
■ 1996 
^— Theoretical 


10-' 


\ a=4_6 


: 


10"' 






V/ 


10-" 


France 


^L 


10-' 






^^9k 



FIG. 4: Frequency distributions (Left) and complementary cumulative distributions (Right) for 
the capital productivity in France in the years 1996-2001. The theoretical behavior is for a = 4.6, 
m = 0.06, n = 0.02, a = 0.4 and (3 = 68. 

These very high values of the exponents might be consequence of the irrational euphoria 
of the late 90es when the stock markets were hit by a speculative bubble (1997) and its 
subsequent crash (2000). The bubble increased the value of the firms' asset thus lowering 
the value added-capital (i.e. capital productivity) ratio and soaring the power law coefficient 
of the power law distribution of the capital productivity distribution. However the very high 
capital productivity regions show a slowing down which could be fitted with lower exponents. 

V. CONCLUSIONS 

In this paper we have shown that the productivity of non-financial firms is power law 
distributed. This result is robust to different measures of productivity, different industrial 
sectors, years and countries. We have also argued that the empirical evidence corroborates 
the prescription of the evolutionary approach to technical change and demonstrated that 
power law distributions in productivity can be interpreted as consequence of a simple mecha- 
nism of exchanges within a social network. In particular, we have shown that the expectation 
values of the productivity level are proportional to the connectivity of the network of links 
between firms. The comparison with the empirical data indicates that such a network is of 
a scale-free type with a power-law degree distribution. In the present formulation we have 
assumed an underlying network which is fixed in time. This allows obtaining equilibrium 
solutions. On the other hand, a more realistic analysis should consider a non-static underly- 
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ing network and therefore non-equilibrium trajectories modulated by the fluctuation in the 
underlying network. This non-equilibrium dynamics can be studied numerically from Equa- 
tion [T] by using fluctuating exchange coefficients Qj^iit) . This is left to future research. In 
this paper we had a narrower goal: to show that empirical evidence is very well fitted by 
the evolutionary view of technical change. 
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APPENDIX A: CUMULANT PROPAGATION 

By using the Fourier transformation (Equation Ej), Equation |^ becomes: 

Pt+i{y,l) = da\kt{a,l)W\-—^ dx? ■ ■ ■ t/xg^ (Al) 



r 


dalktip. 


t-1 

•"11 [i 


1 


/ -OO 


dxf ■ 


J — OO 


dx^S 








J —OO 


2txY 




J — CXD 


7 (■£■) jr<^'«<^' A 




1)- 


J ^oo 


d^^^e 




Pt^ 


-c(^S\ 


N)_ 


1 
27rj 


— OO 


■1 (0) 


"^J6I; 


\^f- 


-riQ,- 


.i^y:-2 


,r'[-r'- 


-r 


+^'])<^| 





where the Dirac delta function has been written as 

%-2/o) = ^y rf0e-^(^-^»)^ . (A2) 

Equation lAll can be re- written as: 

P,+i(y, = 7^ y da\k,{a, I) j dcPe-^^y-^^'t' (A3) 
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The integration over the x's yields 

fOO 



Pt+iiy,l) = ^l da\At{a,l) I d<p\e-'^y-''^^Pti<P,l) (A4) 



t-i 



nA(g,^/0,j)A-i(-g,^i0,j) 

Its Fourier transform is: 

-1 /»00 /"OO /»oo 

Pt^,(^J) = dalAtiaJ) rf0e*"M rfye'^^^'^-'^) (A5) 

t-1 
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Equation IA5I can be integrated over y giving the Fourier transform of Equation 0] which is 
Equation ini in Section UTTl 
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